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TO ALL WHOM IT MAY CONCERN: 

Be it known that we, David C. Feldmeier and Tyler G. Arnold, both citizens of the 
United States of America, residing in Morristown, County of Morris, State of New Jersey, 
and Long Valley, County of Morris, State of New Jersey, respectively, have made a certain 
new and useful invention in a PARTIALLY-ORDERED CAMS USED IN TERNARY 
HIERARCHICAL ADDRESS SEARCHING/SORTING of which the following is a 
specification. 



SPECIFICATION 
FIELD OF THE INVENTION 

The invention relates to digital hierarchical address filtering and more specifically 
to methods for increasing the rate at which hierarchical address filtering, such as the 
network address contained in a packet, can be performed, 

BACKGROUND OF THE INVENTION 

Address translation is the process of mapping an address, such as the network 
address contained in a packet, to some desired information. Examples of desired 
information include determining the output port of a switch to which a packet should be 
sent, or determining the address of the next-hop router for the routing of IP (Internet 
Protocol) datagrams. Address filtering is similarto address translation, except that instead 
of finding the data associated with some address, it is only important to determine whether 
a given address exists in a table of addresses. Throughout this disclosure, the phrase 
"address translation" encompasses both address translation and address filtering. 

Addresses can be placed into two categories with regard to processing: flat and 
hierarchical. Flat addresses are those addresses that have no internal structure that is 
useful from a processing aspect. Ethernet addresses are an example of a flat address. 
Although Ethernet addresses have some structure to them (e.g., one part of the Ethernet 
address denotes the manufacturer of the equipment using the given Ethernet address), the 
structure is not relevant to protocol processing operations, such as routing. Many • 
techniques have been developed for accelerating flat address translation. 

Hierarchical addresses are those addresses that have an internal structure that is 
useful for protocol processing. Examples of hierarchical addresses include IPv4 
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addresses, IPv6 addresses, E. 1 64 addresses (used in ATM-asynchronous transfer mode), 
and telephone numbers. To better illustrate the internal structure of a hierarchical address, 
a telephone number is used . Consider a telephone number: (908) 979-1 01 0. The highest 
level of the hierarchy is denoted by the "area code" 908. The next level of the hierarchy 
is the "central office code" 979. The lowest level of the hierarchy is the "station number" 
1010. The hierarchical structure of the telephone number is used to determine how to 
route a call through the telephone network. For example, if a call both originates and 
terminates in the 979 central office, then the phone call passes only through the 979 
central office. If a call both originates and terminates in the 908 area, then no long- 
distance carrier is used to carry the call. Note that a flat address is a hierarchical address 
with a single level of hierarchy. Thus, any address translation techniques that operate on 
hierarchical address can be applied to flat addresses as well. 

The main importance of the structure of hierarchical addresses is that there is no 
need to store information about every single address in order to be able to process all 
addresses. Information about entire classes of addresses can be stored in a single entry. 
For example, for a call that originates from area code 908 and terminates in some other 
area code, the correct action is to forward the call to a long distance carrier, regardless of 
the specific terminating area code. 

The tables used for hierarchical address translation can be thought of as a sieve. 
Consider the following example, as shown in Fig. 1 , of a translation table that might exist • 
in the 979 central office switch (X's represent "wildcard" or "don't care" values that match 
all digits). Telephone numbers are compared with table entries in order from top to bottom 
looking for a matching entry. The telephone number 908 979 1035 would match entry A 



in the table, and the call would be routed to the station number 1035. The telephone 
number 201 829 5136 matches entry D of the table and would be routed to a long distance 
carrier for transport. Note that any telephone number that matches an entry also may 
match some of the entries that follow. The fact that entries are examined in order assures 
5 that routine is done correctly. 

Other methods can be used to search the table that will provide the same result as 
if the table were searched serially. One technique is to use a tree structure, such as a 
PATRICIA (Practical Algorithm To Retrieve Information Coded In Alphanumeric) tree. The 
Art of Computer Programming (Knuth): Introduction to Algorithms (Cormen, Leiserson, and 

10 Rivest). The search begins at the root of the tree, which corresponds to the top of the 
hierarchy. Fig. 2 depicts such a search method and comprises the table of Fig. 1 re- 
arranged as shown in Fig. 2. The first comparison would be against just the area code 908. 
If 908 were found, then the entries indented under 908 would be searched. Otherwise, the 
call will match the last line and be routed to a long distance Point-of-Presence. 

15 Another alternative would be to search all entries in parallel. As shown in Fig. 3, the 

table can be augmented with explicit hierarchical levels assigned to each table entry. 
The telephone number 908 979 1035 matches table entries A, C and D. The desired 
match is entry A, which has the highest level number (3 in this case, i.e., only two entries 
need to be delineated A & B. The word "address" as used above may actually be a 

20 combination of data fields from a packet header that is treated as a single address for • 
purposes of address translation. For example, the address may be a combination of 
source address, destination address and priority fields. It should be noted that any 
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combination of different header elements forms a single hierarchical address, in which the 
different elements form part of the structure of the hierarchical address. 

However, these approaches are limited by the constraints of software processing 
speeds and comparisons with table entries are typically performed in sequential order. 
Thus, as network communication speeds increase, it becomes necessary to find high- 
speed translation techniques to operate at wirespeed. 

One way to achieve wirespeed is to utilize a content-addressable memory (CAM). 
CAMs differ from conventional RAM (random access memory), SRAM (static RAM) and 
DRAM (dynamic RAM) in that they are organized differently. In a CAM, data is stored in 
locations in an arbitrary fashion. The locations can be selected by an address bus, or the 
data can be written directly into the first empty location because every location has a 
special status bit that keeps track of whether the location has valid information in it or is 
empty and available for overwriting. Once- information is stored in a memory location, it 
is found by comparing every bit in memory with data placed in a special Comparand 
register. CAMs also comprise a mask register which allows selection of which bits will 
participate in the comparison. If there is a match for every bit in a location with every 
corresponding bit in the Comparand register, a Match Flag is asserted to the user know 
that the data in the Comparand register was found in memory. A priority encoder sorts out 
which matching location has the top priority, if there is more than one, and makes the 
address of the matching location available to the user. Thus, a CAM is a memory device • 
that allows retrieval of information by specifying part of the stored information, rather than 
by specifying a storage address. For example, using masking, if the entries hex "abed," 
"abde" and "accd" were stored in a CAM, the CAM could be instructed to return the 



complete contents of the entries of all locations beginning "ab. M In this example, the CAM 
would return entries "abed" and "abde." 

CAMs are generally classified as either binary or ternary CAMs. Binary CAMs store 
binary entries, while ternary CAMs store ternary entries (see PCT Application No. 
PCT/US97/13216, WO98/07160). Binary entries are entries that contain only 0 or 1 
values, while ternary entries are entries that contain 0, 1, or X (i.e., "don't care") values. 
Note that a single ternary entry can be expressed as two or more binary entries. In other 
words, a single ternary entry "1X0" can be represented by two binary entries "110" or "1 00", 
or a single ternary entry "1 XX" can be represented by four binary entries "1 00", "101", "1 1 0" 
or "111", etc. Since hierarchical addresses often comprise ternary values (e.g., a 
telephone lookup table number "908-979-XXXX"), ternary CAMs require a smaller number 
of table entries to represent each hierarchical address than binary CAMs. However, 
ternary CAMs require more complex hardware and are generally more expensive than 
binary CAMs. (see U.S. Patent No. 5,319,590 (Montoye)). In particular, one method to 
construct a ternary CAM is to design the CAM cells and sense amps to enable the storage 
of three distinct voltage levels (high, medium and low) which would correspond to the 
ternary values. 

The following provides a summary of existing searching methods using CAMs. 
Expanding Each Ternary Entry into Multiple Binary Entries 

While hierarchical addresses can be directly stored in ternary CAMs, in order to be • 
stored in binary CAMs they must first be translated into binary format. As discussed 
above, a ternary address can be translated into two or more binary addresses. However, 
the number of binary addresses needed to represent a ternary address is 2 m , where m is 



the number of bits required to represent each "don't care" value (X) in the ternary address. 

For example, the ternary address ABCD_123X hex , where "X" indicates the "don't care" 

value, covers the set of binary addresses given by {ABCD_1230 hex ;ABCDjl231 hex ;... 

ABCD_123F hex } where m=4 to cover the range 0 to F. Hence, the ternary address 

ABCDJ23X hex requires 16 (i.e., 2 4 ) entries in the binary CAM. To that end, in accordance 

with A.S.N. 08/818,073, filed on March 14, 1997, entitled "Accelerated Hierarchical 

Address Filtering and Translation," (now U.S. Patent No. 5,920,886), which is assigned 

to the same assignee as the present invention, namely, Music Semiconductors, Inc., and 

all of whose disclosure is incorporated by reference herein, there is disclosed a method for 

storing such entries in a binary CAM acting as a ternary CAM but not having to store these 

entries in sequential order. 

Hierarchical Level Field Searches 

Another search technique involves- the use of entries in a ternary CAM with a 

hierarchical level field, using a linear search of this level field. Fast Routing Table Lookup 

Using CAMs % (McAuley and Francis, 1993). In particular, this technique uses a ternary 

CAM to store ternary addresses and wherein each ternary address comprises an 

associated level field. The entries may be stored in the CAM in any order. The following 

is an example of how an address and mask are stored. The ternary address, 

ABCD_123X hex /28 (where "28" indicates ^the number of contiguous ones in the mask), 

would be stored in the CAM as: 

ternary field = ABCD_123X hex 

priority field = 111 00 binary (where 1 1 1 00 bjnary = 28 dedmal ) 



The technique then begins searching from the lowest priority level (i.e., the level with the 
longest network mask, viz., 11111 bjnary ) and working towards the higher levels, depending 
upon whether there are any matches. The search is altered based on the level field. The 
hierarchical address is not masked during the search and only parts of the level field are 
5 masked. If a match is found, the search is complete; if there are no matches, the level field 
is decremented and the search continues. The worst case number of searches is the 
number of distinct hierarchical levels, N. 

Entries in Ternary CAMs with Hierarchical Level Field. Binary Search of Level Field 

Another technique, that is an improvement of the Hierarchical Level Field Searches, 

10 (described above), is disclosed in A.S.N. 08/818,073, filed on March 14, 1997, entitled 
"Accelerated Hierarchical Address Filtering and Translation", (now U.S. Patent No. 
5,920,886), which is also assigned to the same assignee as the present invention, namely, 
Music Semiconductors, Inc., and all of whose disclosure is incorporated by reference 
herein. In this technique, instead of starting at the lowest level of the hierarchy (comprising 

15 N hierarchical levels) and working up (i.e., Hierarchical Level Field Searches), this search 
technique starts in the middle and eliminates from consideration one half of the remaining 
hierarchical levels during each iteration. In particular, the search starts in the middle mask 
(i.e., MM=N/2) and, as an example, N=32. Depending upon the search result, the search 
either moves up or down MM/2 mask levels and tries again. The search begins looking for 

20 matches with the level field 1XXX, which will match against priority levels 16-31 since: 
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31^= 11111 

If there are multiple matches in this range (16-31), the mask is changed to 11XXX (i.e., 
replace the most significant X of the priority mask with a "1"). By doing so, the search 
range is halved, resulting in a new smaller range, viz., 24-31 . After searching with the new 
mask, if there is one match, the search is complete; if there is no match, the least 
significant 1 of the priority mask is replaced with a 0 (i.e., the new mask is now 10XXX) so 
that the range of 16-23 is now searched. 

If, on the other hand, there are no matches in this range (1 6-31 ) when the original 
mask 1XXX was used, the mask is changed to OXXX which means that the search is now 
looking at the range 0-15 of the priority masks, since: 

Ocecimal = 00000 
1decimal= 00001 / 

8decimal= 01000' 
• . - 

15— = 01111 

After searching with the new mask, if there is one match, the search is complete; 
otherwise, the search process is repeated by replacing the most significant X with a 1 (i.e., 
01XX, searching the range 8-15). If there is no match, the least significant 1 is replaced 
with a 0 (i.e., 00XX, searching The range 0-14) and the process repeated until a match is 
found. 

This technique is depicted in Fig. 4 and can be summarized as follows: The ternary 
CAM is first searched for an address and a priority field having a 1 in the most significant 
bit position and an X in all other bit positions, in stage 2. Stage 4 then determines if there 



are any matching entries, in which case, the operation proceeds to stage 8; otherwise, the 
least significant bit in the priority field having a value of 1 is replaced by a value of 0 in 
stage 6. Stage 8 then determines whether any bits of the priority field have a value of X, 
in which case, the most significant bit in the priority field having a value of X is replaced by 
5 a value of 1 in stage 10. The CAM is then searched for the address and the modified 
priority field, in stage 12. Stage 14 determines whether there is a single matching entry, 
in which case, the matching entry is retrieved from the CAM in stage 1 6; otherwise, stages 
4-14 are repeated until the test of stage 14 is satisfied and the operation terminates. Thus, 
one X is resolved (i.e., replaced by a 1 or a 0) after each search until a matching entry is 
10 found. 

This technique requires, in the worst case, a number of searches equal to the 
number of bits used to represent the priority field (i.e., if N is the number of hierarchical 
levels represented by the priority field, log 2 N searches are required to find a matching entry 
at the lowest hierarchical level, as all bits of the priority mask must be resolved). 
15 Full-Sorted Order with Different Masks 

Another technique disclosed in Fast Routing Table Lookup Using CAMs . (McAuley 
and Francis, 1993) comprises the storage of entries in full-sorted order in a binary CAM 
with linear searching of the hierarchy using different mask registers. Binary entries are 
stored in a binary CAM and the CAM entries must be ordered inversely by level of 
20 hierarchy. Instead of using priority masks, this technique masks off the address being • 
searched in a manner similar to how the priority masks are decremented, as described 
above under "Hierarchical Level Field Searches". In particular, this technique (shown in 
Fig. 5) starts with the most precise mask of all "care" bits in the mask register and searches 
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the CAM through the mask register. For example, for a 32-bit address search, the 

technique would begin with all "O's" in the mask register (a /32 mask), where "0"s 

correspond to "care" and "1 "s correspond to "don't cares" in a CAM search through a mask. 

If there is a match, the search is completed and the top CAM match (i.e., the lowest CAM 

5 address) is selected. If there is no match, the technique moves down to the next highest 

mask value, which is /31 in this case. The search is repeated again through the mask 

register, repeating the process until a single match is received. The worst case number 

of searches is N searches, where N is the number of hierarchical levels. 

Entries in Full-Sorted Order in Binary CAM with Binary Search of Hierarchy Using Different 
10 Mask Registers 

This technique, as disclosed in A.S.N. 08/818,073, filed on March 14, 1998, entitled 
"Accelerated Hierarchical Address Filtering and Translation", (now U.S. Patent No. 
5,920,886), is an improvement to Full-Sorted Order with Different Masks (discussed above) 
by providing a search to log 2 N. This technique also starts with the median, or half-way 

15 mask, which would be the /1 6 mask in a 32 mask level address scheme. If there is a single 
match, the search is complete. If there are multiple matches, the search moves to the 
mask half-way up, which is the /24 mask in this iteration and the search continues. If there 
are still no matches, the search moves the other way and half-way down, i.e., to the /8 
mask in this iteration and the search continues. This process is repeated until one match 

20 is found or until no matches are found, in which case the highest priority match from the • 
last multiple match search is selected. 
Full-Sorted Order 
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This technique is disclosed by Fast Routing Table Lookup Using CAMs . (McAuley 
and Francis, 1993) and is shown in Fig. 6. In this technique, ternary entries are stored in 
sorted, i.e., inverse hierarchical, order. A search takes a single cycle because the search 
obtains the longest match address which is stored at the lowest CAM address. The major 
advantage of this approach is the ideal one search cycle address resolution time. On the 
other hand, the downside of this technique is the required resorting which makes this 
technique impractical. 
IP Classless Inter Domain Routing (CI PR) 

The present invention can be implemented in any type of communication system 
where hierarchical addressing and associated address masks are used. As an example, 
as will be disclosed below, the present invention can operate on the IP Classless Inter 
Domain Routing (CIDR) to find the "longest match" in an IP routing table. CIDR is a 
common method of reducing address table size, which categorizes address aggregations 
on arbitrary mask bit boundaries. CIDR reduces the size of IP tables by making it possible 
to group thousands of entries under one entry in the CIDR table. Fig. 7 shows two 
examples of how the 32-bit network mask values are structured according to Internet 
Request For Comments 940 and 1519. The masks always have a structure consisting of 
contiguous "0"s on the right. A mask never has "1 "s and "0"s interleaved, or "0"s on the left 
and "1"s on the right. Fig. 8 shows examples of invalid mask structures. 

Fig. 9 shows four entries from a CIDR routing table (in hexadecimal format) and the . 
network masks that relate to them. It can be seen that the mask boundaries are clearly 
demarcated. The number after the backslash indicates the number of continuous ones in 
the mask. For example, the first entry's mask is FF.FF.FF.FF, which corresponds to 32 bits 
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set to 1 . This mask is an IP mask, where a 1 indicates a "care" location and a 0 indicates 
a "don't care." A "care" bit means that the corresponding bit locations of the numbers being 
compared must be identical. A "don't care" bit indicates that the bit locations are not used 
in the comparison. The more the number of 1 -bits or "cares" in a mask, the less the number 
of possible matches. Therefore, a higher mask number is more precise than a lower mask 
number. Note, an IP address search may match multiple CIDR routing table entries 
because masking is used to further specify smaller address ranges. 

In Fig. 9 , the last address/mask entry actually corresponds to 1 1 00 0000 0001 1 000 
XXXX XXXX XXXX XXXX where the X's stand for "don't cares." Any search value whose 
top 16 bits equal 1 100 0000 0001 1000 would match the last entry. Note that in Fig. 9, the 
entries are ordered by mask value from most 1s (FF.FFIFF.FF) to the entry with the least 
number of 1s (FF.FF.00.00, in this case). A search for C0.18.0C.15 would yield matches 
on all the entries but the first entry is chosen because it is the match with the longest group 
of 1s in the mask (the so-called "longest match"). A search for C0.18.0C.16 would match 
only the second, third, and fourth entries. 

When multiple matches occur during a search, CAMs select the "best" entry by their 
physical addresses. Matches with lower addresses are selected over matches with higher 
addresses. For example, if a search yields matches at CAM address 1 and 2, the match 
at address 1 is selected. In a typical layer 2 address filtering application, the order of the 
entries in the CAM is unimportant because it is expected that there will be no more than . 
one match result from any given search. 

However, hierarchical address structures like IPv4, IPv4 CIDR, and IPv6 differ by 
having the possibility of having multiple matches during a search operation. The best 
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match is the "longest match," or the entry has the largest number of contiguous "cares" or 
1s. Therefore, if a CAM is used to store hierarchical addresses, sorting is necessary. There 
are two alternatives for this sorting: sorting when inserting entries, or sorting when reading 
matches. Fast hardware lookup for high bandwidth routing and switching places an 
emphasis on high-speed searches. Choosing to sort when inserting entries is therefore 
a logical decision because it enables fast constant time address resolution. 

Thus, as network communication speeds increase, there remains a need to conduct 
hierarchical address translation at wirespeed, rather than relying on software techniques 
as is practiced in conventional translation techniques, or on present CAM techniques. In 
particular, there remains a need for a method of sorting entries in the CAM to obtain the 
longest match in the shortest amount of time. 

OBJECTS OF THE INVENTION 

Accordingly, it is the general object of this invention to provide an apparatus which 
improves upon and overcomes the disadvantages of the prior art. 

It is another object of this invention to provide faster hierarchical address translation 
to most any hierarchical data. 

It is another object of this invention to provide faster hierarchical address translation 
by utilizing address resolution/search that is always completed in one CAM search cycle. 

It is still another object of this invention to provide an apparatus and method having 
a sorting mechanism whose worst case cost for inserting new entries is proportional to the 
number of masks and is not proportional to the number of entries. 
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It still yet another object of this invention to provide an apparatus and method 
whereby entries only need be sorted by mask levels which means that ordering inside 
mask levels is irrelevant. 

It is even yet a further object of this invention to provide an apparatus and method 
using a sorting method whereby table updates can be successfully interrupted by searches 
as long as the current maintenance atomic operation (e.g., an insert operation) is allowed 
to complete. 

It is still yet a further object of this invention to provide an apparatus and method 
whereby each binary code pair, used to represent a ternary value, are stored at n and 
n+32, thus making the binary to ternary conversion easy to compute using a general 
purpose processor. 

It is still yet a further object of this invention to provide an apparatus and method 
whereby the binary encoded ternary conversion is accomplished inside the CAM itself to 
reduce the input/output bandwidth. 

It is another object of this invention to provide an apparatus and method for faster 
hierarchical address translation for IP datagram routers. 

It is still another object of this invention to provide an apparatus and method for 
faster hierarchical address translation for layer-3 switches. 

It is still yet another object of this invention to provide an apparatus and method for 
faster hierarchical address translation in ATM switches during connection setup for the • 
translation of hierarchical E.164 addresses. 

It is still yet another object of this invention to provide an apparatus and method for 
faster hierarchical address translation for data communication. 
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It is still yet another object of this invention to provide an apparatus and method for 
faster hierarchical address translation for multimedia communication. 

It is still yet another object of this invention to provide an apparatus and method for 
faster hierarchical address translation for integrated service communication. 

It is even a further object of this invention to provide an apparatus and method for 
the encoding of ternary addresses in binary CAMs. 

It is yet another object of this invention to provide an apparatus and method for 
obtaining the longest-match ternary address resolution in a single search cycle of a binary 
CAM. 

It is still yet a further object of this invention to provide an apparatus and method for 
maintaining a sorted routing table for search techniques for binary CAMs. 

It is still yet another object of this invention to provide an apparatus and method for 
maintaining a sorted routing table for ternary CAM. 

SUMMARY OF THE INVENTION 

These and other objects of the instant invention are achieved by providing a content 
addressable memory (CAM) of a communications system (e.g., data and 
telecommunication systems, network systems such as Internet Protocol, (IP), layer-3 
switches and asynchronous transfer mode (ATM) used in E.164 addressing) utilizing 
ternary hierarchical addressing and associated address masks. The CAM (e.g., a ternary 
CAM or a binary CAM) comprises a plurality of address entries and associated address 
masks that are arranged in the CAM by mask number, with address entries having the 
highest mask number being located at address entry locations at the top of the CAM and 
address entries having the lowest mask number being located at address entry locations 
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at the bottom of the CAM (known as "partial ordering"). The mask number is defined as the 
number of contiguous ones in an associated address mask. 

These and other objects of the instant invention are also achieved by providing an 
apparatus for storing a plurality of address entries and associated address masks in a 
communication system (e.g., data and telecommunication systems, network systems such 
as Internet Protocol, (IP), layer-3 switches and asynchronous transfer mode (ATM) used 
in E.164 addressing) utilizing ternary hierarchical addressing and associated address 
masks. The apparatus comprises a binary CAM and a binary-encoded ternary (BET) 
conversion means coupled to the binary CAM wherein the BET conversion means converts 
each ternary value into a corresponding BET value to form the address entries. 
Furthermore, the plurality of address entries and associated address masks are arranged 
in the binary CAM by mask number, with address entries having the highest mask number 
being located at address entry locations at the top of the CAM and address entries having 
the lowest mask number being located at address entry locations at the bottom of the CAM 
(known as "partial ordering"). The mask number is defined as the number of contiguous 
ones in an associated address mask. 

These and other objects of the instant invention are also achieved by providing a 
method for accelerating the routing of hierarchical addressing in a communication system 
which utilizes ternary hierarchical addressing and associated address masks (e.g., data 
and telecommunication systems, network systems such as Internet Protocol, (IP), layer-3 • 
switches and asynchronous transfer mode (ATM) used in E.164 addressing). The method 
comprising the steps of: (a) obtaining communication system hierarchical addresses and 
associated masks to form address entries; and (b) storing the address entries in a content- 
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addressable memory (CAM) by mask number wherein the mask number is defined as the 
number of contiguous ones in an associated address mask and wherein address entries 
having the highest mask number are stored in address entry locations at the top of the 
CAM and address entries having the lowest mask number are stored at address entry 
locations at the bottom of the CAM (known as "partial ordering"). 

These and other objects of the instant invention are also achieved by providing a 
method for maintaining a sorted CAM to enable longest matches in a single search cycle 
when hierarchical addresses are added to, or deleted from, the CAM in a communication 
system utilizing hierarchical addresses and associated address masks (e.g., data and 
telecommunication systems, network systems such as Internet Protocol, (IP), layer-3 
switches and asynchronous transfer mode (ATM) used in E.164 addressing). The method 
(known as "block-edge sorting") comprises the steps of: (a) segmenting the CAM into 
blocks wherein each block corresponds to a single hierarchical mask and wherein the 
blocks are arranged in the CAM such that the highest hierarchical masks are located at the 
lowest CAM addresses and the lowest hierarchical masks are located at the highest CAM 
addresses; (b) storing hierarchical addresses according to the block having a 
corresponding hierarchical mask; and (c) tracking (1) the first address of each of the blocks 
(floor); (2) the next free address of each of the blocks (nxtfree); and (3) the size of each of 
the blocks. 

DESCRIPTION OF THE DRAWINGS 
Other objects and many of the attendant advantages of this invention will be readily 
appreciated as the same becomes better understood by reference to the following detailed 
description when considered in connection with the accompanying drawings wherein: 
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Fig. 1 is a prior art telephone number translation table; 

Fig. 2 is the translation table of Fig. 1 re-arranged for conducting PATRICIA tree 
searching; 

Fig. 3 is the translation table of Fig. 1 augmented by particular hierarchical levels, 
thereby permitting parallel searching; 

Fig. 4 is a flow diagram for a prior art CAM search technique: Entries in Ternary 
CAMs with Hierarchical Level Field, Binary Search of Level Field; 

Fig. 5 is a flow diagram for another prior art CAM search technique: Full Sorted 
Order with Different Masks; 

Fig. 6 is a flow diagram for another prior art CAM search technique: Full Sorted 

Order; 

Fig. 7 is a diagram of the structure of valid network masks; 
Fig. 8 is a diagram of the structure of invalid network masks; 
Fig. 9 is a diagram of addresses and masks from a CIDR routing table; 
Fig. 10 depicts a CAM using partial ordering in accordance with the present 
invention; 

Fig. 1 1 depicts a CAM that does not use partial ordering; 
Fig. 12 is a flowchart for establishing partial ordering in a CAM; 
Fig. 13 shows a portion of a communication system where a CAM using partial 
ordering is used; 

Fig. 14 depicts an exemplary communication system which uses CAMs; 
Fig. 1 5 depicts a binary CAM with external binary-encoded ternary (BET) conversion 
means in a portion of a communication system using the present invention; 
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Fig. 16 depicts a binary CAM with internal binary-encoded ternary (BET) conversion 
means in a portion of a communication system using the present invention; 

Fig. 17 depicts a ternary CAM in a portion of a communication system using the 
present invention; 

5 Fig. 1 8 depicts the contents of the binary CAM when binary-encoded ternary (BET) 

conversion of the present application is applied; 

Fig. 19 depicts the binary encoding of ternary values; 
Fig. 20 depicts the CAM compare truth table with ternary values; 
Fig. 21 depicts the CAM compare truth table; 
10 Fig. 22 depicts the storage of the IP address/IP mask of Fig. 1 8 in bits; 

Fig. 23 depicts the relationship between the upper bits of Fig. 22 (C0.1 8.0C.1 5) and 
the lower bits (3F.E7.F3.EA) of Fig. 22, the latter of which is the one's complement of the 
upper bits; 

Fig. 24 depicts a plurality of IP addresses/IP mask entries stored in the binary CAM 
15 in accordance with the present invention; 

Fig. 25 depicts the compare results in accordance with the present invention; 
Fig. 26 is a flowchart for the method of the present invention for storing the binary- 
encoded ternary addresses in a binary CAM; 

Fig. 27 is a flowchart of a binary CAM operation during a search routine; 
20 Fig. 28 depicts insertion of a new hierarchical address entry into a non-full block; * 

Figs. 29a-29c together depict the method for insertion of a new hierarchical address 
entry using one edge sort; 
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Figs. 30a-30c together depict the method for insertion of a new hierarchical address 
entry using two sorts; 

Figs. 31a-31c together depict the method for insertion of a new hierarchical address 
entry recursive downward sorting; 

Figs. 32a-32c together depict the method for upward insertion of a new hierarchical 
address entry; 

Figs. 33a-33c together depict the method for upward insertion of a new hierarchical 
address entry using recursive upward sorting; 

Figs. 34a-34b together depict the method for deleting any entry; and 

Fig. 35 depicts an interleaving cycle for recursive sorting. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

Referring now in detail to the various figures of the drawing wherein like reference 
characters refer to like parts, there is shown at 20 in Fig. 10, a CAM in which the address 
entries are arranged by mask number (i.e., the number of contiguous ones in the mask), 
from the highest mask number at the top of the CAM 20 to the lowest mask number at the 
bottom, irregardless of the sequence of the addresses themselves. This arrangement, 
hereinafter known as "partial ordering (PO)'\ is depicted in that the addresses A, B f and 
C are not in alphabetical order, but are stored in the CAM in groups having the same mask 
number and whereby these groups are stored in descending mask number order (which, 
by definition, is in increasing CAM address order). On the other hand, Fig. 1 1 depicts a 
CAM that does not use PO. Fig. 12 depicts a method 220 for optimally implementing a 
partially-ordered CAM 20, hereinafter known as "block-edge sorting (BES)", and will be 



-21- 



discussed in detail later. Suffice it to say for now that the "block-edge sorting" method is 
a specific application of PO. 

Partial ordering incorporates significant advantages over the number of masks used 
in hierarchical address schemes. The primary search advantage is that the address 
resolution/search is always completed in one CAM search cycle. Furthermore, the primary 
sorting advantage to this approach is that the worst case cost for inserting new entries is 
proportional to the number of masks (N mask ), and is not proportional to the number of 
entries. 

The crucial realization of this algorithm is that entries only need to be sorted by 
mask levels which means that ordering inside mask levels is irrelevant. Another advantage 
to the sorting method is that table updates can be successfully interrupted by searches as 
long as the current maintenance atomic operation (e.g., an insert) is allowed to completion. 

At this point, it should be understood that the CAM 20 includes both binary CAMs, 
as well as ternary CAMs, for storing the ternary hierarchical addresses. As such, it is within 
the broadest scope of the present invention that PO and BES can be implemented using 
either binary CAMs or ternary CAMs. Where a ternary CAM is used, the ternary 
hierarchical address being stored (e.g., C01 80000/1 6, the "C0 18" being hexadecimal 
values and the "16" indicating the number of "cares") would simply be stored as is (i.e., 
C018XXXX). Where a binary CAM is used, the additional steps of first converting the 
ternary hierarchical address into binary-encoded ternary (BET) and then storing the BET 
address in a unique manner (hereinafter, BETAS) in the binary CAM is discussed below. 
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Thus, the following discussion is directed to the more-involved situation where a binary 
CAM is used, it being understood that use of a ternary CAM would be a simpler case that 
would skip these additional steps. 

As shown in Fig. 13, the CAM 20 using PO forms a portion (e.g., an n-port router 
22 that utilizes the CAM 20 for forwarding decisions) of a communications system (e.g., 
network 24, Fig. 1 4). The CAM 20 interfaces with forwarding hardware 26 via a search data 
line 28 and routing decision line 30. The forwarding hardware 26 forms n network 
interfaces 32. It should be understood that the network 24 is exemplary only and that it is 
within the broadest scope of this invention to include the CAM 20 using PO in any type of 
communication system, e.g., telecommunication system, data communication system, etc. 

To place hierarchical entries in the CAM 20, it is necessary to sort their position in 
the CAM 20 so that the longest match is selected first. Since CAMs select the best match 
by lowest CAM address value, the entries with the longest match mask are stored in the 
lowest CAM addresses. If the following condition is met, a hierarchical address search is 
guaranteed always to resolve to the longest match entry in one CAM search cycle: 

For all CAM 20 entries n: mask(n)^mask(n+1) 
As will be discussed in detail below, the PO of the present invention ensures that the CAM 
20 entries are sorted properly so that searches always find the longest matching entry (i.e., 
the entry with the most number of T's in its mask). By ensuring that the CAM 20 entries 
are always ordered, the longest matching address is always found within a single compare 
operation. 

To implement the PO and BLS of the present invention using a binary CAM, for 
example, it is necessary to discuss how the BET (binary-encoded ternary) values are 
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stored in the binary CAM 20 (Figs. 18-26). Next, a flowchart describing the CAM 20 
operation during a search is shown in Fig. 27. Then the BES method 220 (Fig. 12) is 
explained using Figs. (28-35). 
BET Conversion/Storage in a Binary CAM (BETAS) 

To facilitate IPv4 CIDR, a CAM 20 is split into separate groups or blocks 
demarcated by IP mask values (Fig. 9). If a group of addresses that share the same mask 
are to be stored, there must be some way of encoding the "don't care" condition indicated 
by the address mask. Therefore, as discussed previously, two binary CAM bits are 
required to encode a single ternary (1 , 0, X) entry. To binary encode the ternary values 
(1, 0, X), 64 bits are required to store a 32-bit value. 

The basic principle is that the binary value and its complement are stored if 1 or 0 
is required, but when the "don't care" condition is needed, 0 and 0 is stored. Fig. 1 9 shows 
how the three ternary values are encoded.. 

The actual BET conversion can be accomplished for the binary CAM 20 either in 
external circuitry (Fig. 15, e.g., a separate IC 34A that is located external of the CAM 20 
or forms a portion 34B of the forwarding hardware 26) or in internal CAM 20 circuitry 36 
(Fig. 16) to achieve the result of making a binary CAM appear as a ternary CAM to the 
system, whereby an N-bit binary CAM word acts a N/2-bit ternary CAM word. The 
advantage of using external circuitry 34A or 34B, is that it can enhance system 
performance and increase maintainability. Alternatively, the advantage of using internal 
CAM circuitry 36 (e.g., Music Semiconductors, Inc., assignee of the present invention, 
markets a CAM device with a built-in binary to ternary converter, part no. MUAC4K64, all 
of whose product literature is incorporated by reference herein) is that doing so reduces 
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the required input/output bandwidth by a factor of two. Fig. 17 depicts the situation where 
a ternary CAM is used and the ternary addresses are passed directly to the ternary CAM. 

As an example of the BETAS of the present invention, the BETAS is used to store 
32-bit ternary values as 64-bit binary encoded ternary values in a 64-bit wide CAM. The 
high order bit of each 2-bit ternary value is stored in the upper 32 bits of the CAM entry 
while the low order bit is stored in the lower 32 bits of the CAM entry. For example, as 
shown in Fig. 22, to store C0.18.0C. 15, bits 63:60 (C hex ) of the CAM entry would be 1 100 
and bits 31 :28 would be 001 1 (3 hex ). This is equivalent to the ternary value 1 100 as the 
binary encoded equivalent 10 10 01 01. As can be seen most clearly from Fig. 23, the 
lower 32 bits are simply the 1's complement of the upper 32 bits. This unique storage 
mechanism (BETAS) stores binary complementary values n bits apart, i.e., the binary pair 
values at binary bit n and n+32, thereby making the BET conversion easy to compute using 
a general purpose processor. 

To form the groups of addresses with the same IP masks, the individual addresses 
have their masks added to them as they are inserted into the CAM 20 memory array. To 
store an IP address and IP mask in a 64-bit wide CAM 20 location, the bit-wise logical AND 
of the IP address and IP mask are stored in the upper 32 bits while the bit-wise logical AND 
of the mask and the address's 1's complement are stored in the lower 32 bits. Fig. 18 
shows how the IP address C0.18.0C.15 and the IP mask FF.FF.FF.FF would be stored in 
a CAM memory location according to the BETAS of the present application. Fig. 26 is a • 
flowchart of how the address/mask are stored in the CAM 20 in accordance with the 
present invention. 
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Search of the CAM 20 

The search method (Fig. 27) of the present invention also utilizes the BET scheme 
when searching the binary CAM 20 for a ternary value. It is necessary incorporate a CAM 
mask register when the compare is performed. To search the CAM memory array for the 
ternary value 1 , a "10" is loaded into both a Comparand register 38 and a Mask register 
40 (Fig. 20), which are both located in the CAM 20. The search would yield the results 
shown in Fig. 20. The ternary equivalent of each value is shown in parentheses after the 
binary-encoded value. The search resulted in the CAM entries 00 (X) and 10(1) matching 
the Comparand register 38 contents 10 (1). The contents of the Comparand register 38 
is effectively seen as X0 during the compare because the Mask register 40 is also loaded 
with 10. This makes the first bit of the binary encoded value a "don't care" and will 
therefore match with either 1 or 0. Similarly, a search for 01 (0) would produce a match 
with the CAM entries 01 (0) and 00 (X). This effectively gives a match result when there 
is a ternary X value, which is what is expected. 

When the Mask register 40 is used to mask out bits during the compare, it effectively 
allows ternary "don't cares" to be incorporated during a search. The Mask register 40 
works in the opposite way from an IP mask. When the Mask register bit is set to 1, the 
result will always be a match condition. Fig. 21 shows the Truth Table of CAM compares 
using the Mask register 40. 

There is enough information in each entry to determine the explicit mask/address 
pair that generated it. For example, to search for the entry CO. 18.0C. 15, a compare would 
be performed using C0.18.0C.15 as the upper 32 bits and its Vs complement, 
3F.E7.F3.EA, as the lower 32 bits. The Mask register 40 also is loaded with the same value 
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and is used when the compare takes place. The CAM Mask register 40 is used to 
determine which bits take part in the search and which bits are effectively ignored. As was 
seen in Fig. 20, the CAM mask works in the opposite way from an IP mask. This means 
that 1 corresponds to a "don't care" while a 0 corresponds to a "care" bit. 

Each address is logical ANDed with the appropriate mask value before being stored 
as binary encoded ternary. To perform a search operation using this searching method, 
the address being searched for is copied to the upper 32 bits of both the Comparand 
register 38 and the Mask register 40 and the 1's complement of the address is copied to 
the lower 32 bits. This again is loading the binary encoded ternary equivalent. Fig. 24 
shows how four 32-bit ternary values and their IP masks would be encoded and stored in 
a 64-bit CAM. 

Because of the ability to binary encode the ternary value X ("don't care"), the IP 
address and IP mask can be combined and stored as 64 bits. These 64-bit values are 
stored in the binary CAM 20 memory array in IP mask order. That means, the IP mask 
values that have the higher number of contiguous 1s are stored in the higher priority 
locations. How the CIDR entries are sorted into blocks, will be discussed later. 

If IP address C0.18.0C.1 5 were encoded in the normal manner and compared with 
the four entries in Fig. 24, each entry would produce a match result. If the entries were 
sorted properly, the highest priority match would be entry number 1 (since that IP address 
has the highest number of contiguous 1s in its IP mask and was stored in the CAM in the 
higher priority location), which would be the desired result. Fig. 25 shows the results of the 
comparison between C0.18.0C.15 and entry number 3, i.e., C0.18.0C.00. This shows how 
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the combination of the binary encoded ternary values and the use of the Mask register 40 
work to produce a match result. 

An exemplary CAM 20 is the MUSIC Semiconductors WidePort LAN CAM (all of 
whose product literature is incorporated by reference herein) which has a 32-bit I/O bus. 
This allows the 64-bit binary encoded ternary values to be loaded into the Mask register 
40 in two cycles. The search procedure consists of the following: 

1 . Two Short cycles - Load IP Address to bits [63:32] of Mask register 
and load NOT (IP Address) to bits [31:0] OF Mask register. 

2. One Short cycle - Copy the Mask register to the Comparand register. 

3. One Long cycle - Initiate a comparison with the CAM memory array 
through the Mask register. 

4. One Medium cycle - Read out the CAM array address of highest- 
priority match. This address would be used as an index to access 
associated data stored in external RAM. 

V 

Using these five cycles, the search time required is as follows: 

50ns speed grade CAM: 210ns with a 15ns clock, or 476 million searches per 
second. 

70ns speed grade CAM: 280ns with a 20ns clock, or 3.57 million searches pre 
second. 

Block-edge sorting Method 

In Fig. 12, there is shown a flowchart for optimally implementing a partially-ordered 

CAM 20, which has been referred to as* "block-edge sorting (BES) W . Thus, the goal of 

partial ordering is to maintain a sorted CAM 20 to enable longest matches in one search 

cycle. Although applicability of BES is being described for use with a binary CAM, BES 

can also be used with ternary CAMs. 
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As stated previously, in order to obtain the longest match using either a binary CAM 
20 or a ternary CAM, it is necessary to maintain the CAM 20 so that the ternary addresses 
with the larger mask numbers are stored in the higher priority addresses of the CAM 20. 
This condition can be summarized by the statement: 

For any entry at address n, mask(n) ;> mask (n+1 ) 
As long as this condition is true for all addresses, the longest match is assured in one 
search cycle for lowest-address priority CAM's 20. Fig. 10 shows a properly sorted CAM 
20 that conforms to the condition mask(n);>mask(n+1). The letter in the notation 
"letter/number" can be interpreted as the address while the number corresponds to the 
hierarchical mask level. For instance, "A/32" matches "A/30" up to the 30 th bit and is more 
of an exact match. A search for hierarchical address "A" would match "A/32" first. In 
contrast, Fig. 11 shows CAM with the condition rhask(n)*mask(n+1) violated. Asearchfor 
"A" would get u A/30" instead of "A/32", which is not the longest match and is incorrect. 

The difficult task is to ensure this condition always exists as addresses are added 
to and deleted from the CAM 20. The simple approach is to empty the CAM 20 every time 
it has to be updated and then reinsert every entry. This can be a time-consuming process 
especially if the routing tables are not static and subject to dynamic changes. The time to 
empty and refill the routing tables in the CAM 20 using this method could significantly 
decrease system performance and bandwidth. 

A unique solution to this problem is to use a block-edge sorting algorithm (Fig. 12) m 
that enables real-time modification of the CAM 20 routing table while reducing the time the 
CAM 20 cannot be searched to exceptionally low levels. 
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The block-edge sorting method takes a CAM 12 (or array of CAM's) and segments 
the memory space in blocks. The block characteristics can be represented in software or 
hardware as pointers and flags. Each block corresponds to one and only one hierarchical 
mask. For example, IPv4 has at most 32 unique masks which would be represented by 
32 CAM blocks. Hierarchical addresses are stored in their corresponding mask block. The 
order of the entries within each block is unimportant and is not kept track of because it is 
possible to get no more than one unique match from each block. If more than one match 
does occur from a block, it can only mean that there are multiple identical hierarchical 
address entries in that block. The block-edge sorting method avoids the need to empty and 
refill the CAM every time a new entry appears by reducing the worst case to shuffling 
entries at the edges of each block. This method can be performed either in an iterative 
or recursive manner. 

The block characteristics consist of the: 

Floor(f/): starting address index (known hereinafter as the "floor pointer"), which 
indicates the first address in the block; 

Nxtfree (n/): the next free entry address index (known hereinafter as the "next-free 
pointer"), which indicates the next address to place an entry; 

Blocksize: the block size of remaining entries; and 

Backpressure flag: a flag or flags indicating the status of the fullness of the adjacent 
blocks in the hierarch>r(known hereinafter as the "back-pressure" flags), which 
indicates that this and all other blocks "below" are filled. 

The floor keeps track of the lowest address entry in the hierarchical block. The next-free 

pointer indicates the next destination address for new address entries. The block size 

value in conjunction with the next-free pointer and the floor-pointer keeps track of whether 
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or not the block is full. Alternatively, a remaining entries counter represents the same 
information. 

The first case for inserting a new hierarchical address entry is when the destination 
block has at least one free entry left. In this case, the new address is inserted at the 
location indicated by the next-free pointer for the block. This case is shown in Fig. 28, 
where the new address is inserted and the next-free pointer is incremented to properly 
indicate the next position for inserting entries. 

If the sizes of each hierarchical block are carefully chosen so as to accurately 
represent the mask content of the routing tables being stored, the majority if not all of the 
address inserts will be of the simple default type described in Fig. 28. However, the block- 
edge sorting method expects this not to be the case and reacts accordingly to the cases 
where a new address entry is being inserted into a block that does not have any free 
entries left. 

The next case, illustrated in Figs. 29a-29c, concerns when the destination block is 
full. When the destination block is full, space must be allocated above or below the block 
to accommodate the new entry. In this example, of the block-sorting method and all 
following illustrations, the block-edge sorting method prefers to steal space from the lowest 
hierarchical mask block adjacent to the one in question. However, the method applies in 
either initially stealing in the upwards or downwards direction and loses no generality when 
illustrated in the downwards direction. An optimal implementation might choose to sort in • 
the direction of the nearest free space, either above or below the full block in question. The 
downwards direction is selected only to be aesthetically consistent with the notion that 
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higher mask priorities have precedence over lower mask priorities and the probability that 
there will be more entries with higher mask priorities. 

In Fig. 29a, a new entry is shown with hierarchical mask level 32 being inserted into 
block 32, which is represented by the floor-pointer "f/32" and the next-free pointer "n/32", 
and the block size equal to three, which can be inferred from the diagram by the block 
spacing. Block 32 is full, so the sorting algorithm looks at block 31 , the adjacent block with 
lower mask priority. Block 31 is not full and the back-pressure flag is not set, therefore, 
there is a space in block 31 to steal to accommodate block 32. Fig. 29b shows the floor 
entry A/31 being moved to the next-free location in block 31 . After A/31 is moved, so the 
new entry is moved to occupy the newly-freed location. The pointers and block-sizes are 
adjusted to properly reflect the changes. In this case, block 32's size increases to 4 while 
block 31's size decreased to 3. Note that A/31 is now after B/31 which has no effect 
whatsoever upon the proper sorting in the CAM 20 because the internal ordering of blocks 
is irrelevant as long as the entries in the block have the same hierarchical mask numbers. 

Figs. 30a-30c and Figs. 31 a-31 c show how the insertion method extends to multiple 
blocks. In Figs. 30a-30c, the new entry is destined for block 32, which is full. There is at 
least one space somewhere between that block and the highest memory address because 
no back-pressure flags are set. Due to this fact, the robbing will ripple downwards until it 
terminates successfully upon the first free entry. The moves are executed "backwards" 
from the terminating block to the insertion of the new entry. This has to be done in order . 
to prevent entries from being overwritten before they are copied to their new location. In 
the case for Figs. 30a-30c, A/30 is moved to n/30, then A/31 is moved to f/30, then new/32 
is moved to f/31. Figs. 31 a-31 c show that this sorting can scale to any number of blocks. 
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The first free entry below block 32 is in block 25. In this case, the terminating move is the 
move of A/254 from f/25 to n/25. Block 32's steal carries all the way down to block 25, 
effectively stealing space from it as the only changes in block-sizes are in block 32 and 
block 25 where block 32 increases to 4 and block 25 decreases to 3. 

A special case to consider is when an entry is to be added to a full block and the 
blocks below are also full When this occurs, entries cannot be swept further "downwards" 
to produce an empty location in the required block. Instead, they must be pushed up if 
there is any free space above. See Figs. 32a-32c. 

A "Backpressure" flag, mentioned earlier, is set in the last block when there is no 
more space available. The meaning of the "backpressure flag" is this: if it is not set, there 
is at least one more free entry somewhere between the current block and the highest 
physical memory location. If it is set, then all blocks below the current block are full. 
Blocks attempting to push downward check this flag and, if it is set, are forced to try to 
push entries upward. In this case, recursion downward into the blocks would not work 
because there is no more free memory. Instead, recursion must go upwards. The same 
sorting routine is applied here with the difference being that the sorting goes in the 
opposite direction. Figs. 33a-33c show an example of recursive sorting upwards. 

In Fig. 33a, a new entry is destined for the last block but it is full, requiring the edge 
sorting to go upward instead of downward. Secondly, Fig. 33b shows the moves and 
inserts that will be done to free an entry space in the final block. As before, the Routing - 
database sorted in the CAM will always yield a correct search result due to the order of the 
moves and inserts. Finally, Fig. 33c shows the blocks as represented by their readjusted 
pointers and block size values. 
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In order to delete entries from the CAM routing table and reuse the freed addresses, 
it is necessary to move the last entry in the block to the address being deleted. Figs 34a- 
34b shows the entry B/32 being removed form the routing table. First, E/32 is copied to 
B/32*s address, overwriting and effectively deleting B/32. Then the entry E/32 at the 
original location is deleted and the next-free pointer is decremented to properly reflect the 
new spaces. 

Depending upon how the routing table updates are performed, it may be necessary 
to search for explicit IP address/mask combinations to determine if an entry is already in 
the CAM 20. In this case, the longest match is not what is being looked for. Therefore, the 
search procedure is slightly different from the one used to locate the longest match. When 
searching for the longest match, the Comparand register 38 contents are duplicated in a 
Mask register 40 and the search is performed with the relevant bits masked out. 

When searching for explicit entries ihere is no need to use the Mask register 40. 
The (ADDR & MASK), (!ADDR & MASK) representation of the IP address/mask pair is 
loaded into the Comparand register 38 and a compare is initiated. The address/mask pair 
is compared with the entries in the CAM 20 memory array. This search will only yield a 
match if the CAM 20 entry exactly matches the entry in the Comparand register 38. The 
LANCAM family has foreground and background register sets, which can make this task 
easier. 

The foreground registers should be set to do loading and searches for the longest • 
matches as described earlier while the background registers should be set to perform 
explicit CAM searches that would perform compares without using a Mask register 40. It 
is then only necessary to switch between the two register sets as needed. 
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It is possible to interleave operations of the CAM 20 in order to keep the table 
updated while continuing to perform high bandwidth searches. For searches, the atomic 
operation is the five instruction load/load/copy/search/read index sequence. The insertion 
operation can be broken up into different atomic operations that can be interleaved in 
between gaps in the atomic search operations. The following enumerate the different 
atomic insert operations: 

1. Insert to address: 

Two short cycles: load (ADDR & MASK) and (IADDR & MASK) 
One short cycle: insert to address 

2. Read out entry to be copied/moved 

Fig. 35 shows how the four inserts (I) and three copies (C) required in the previous 
example on recursive sorting could be queued and interleaved in between address 
searches (S). No operation is indicated by.(N). The worst case number of cycles for this 
can be calculated assuming that the new entry is inserted to the first block but all blocks 
except the last are full. For k partitions there are: 

k -1 CAM read cycles 

k - CAM write cycles 

k - RAM read/write cycles 
An important point to note is that since k entries are moved, the overhead of moving 
their associated data in RAM must also be considered. However, this worst case only . 
happens once; with thoughtful initial block allocation, inserting a new entry into the CAM 
will take one write cycle. A good precautionary measure to avoid long sorting operations 
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is to adjust the block size and the spacing during idle periods to avoid shuffling during busy 
periods. 

If the associated data entries stored in RAM are more than a few bytes in length, 
a scheme using a second section of RAM to incorporate pointers might be considered The 
pointers would point to the associated data entries of each CAM 20 entry, which would 
avoid moving large blocks of data. When the associated data entries are shuffled, it is a 
simple case of rearranging the pointers. This would remove the need to move the 
associated data around, which will reduce the associated data maintenance time. 

The methods described in present application are flexible enough to be used with 
different styles of updating the routing tables and therefore allows a system designer or 
software engineer the ability to use the most suitable method available. Using the 
algorithms described in the present application in conjunction with dynamic routing table 
updates would be more efficient than reloading the routing table in the CAM and refilling 
it. This is because search operations could still be performed during the updates. 

Without further elaboration, the foregoing will so fully illustrate our invention that 
others may, by applying current or future knowledge, readily adopt the same for use under 
various conditions of service. 
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